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1. Harvest 10ml of overnight S. cerevisiae in mid-log phase (OD600 = 1.0) and wash with DEPC-H20. 

2. Extract RNA with hot acid phenol as described (Ref: 1). 

3. Generate RNA-seq libraries. For the following workflow, the QuantSeq 3’ mRNASeq Library Prep Kit FWD for Illumina (Lexogen) was used. 
4. Sequence the single-end, 50 base pair reads. Here, an Illumina HiSeq 4000 sequencer was used. 

5. Download S. cerevisiae genome from UCSC (httpy//hgdownload.cse.uesc.edu/goldenPath/sacCer3/bigZips/). Here, version sacCer3 was used. 
6. Convert sacCer3.2bit to fasta format with "twoBitToFa" (http/hgdownload.cse.ucsc.edu/admin/exe/linux.x86_64/) 


./twoBitToFa sacCer3.2bit saccer3.fa 
7. Create index-files for the yeast genome sacCer3 for bowtie2 (Step 12). 
bowtie2-build saccer3.fa index/saccer3 


8. Generate "genes. gt{" file, containmg annotations for genome used (sacCer3), using the online repository http//genome.ucsc.edu/cgi-biv/hgTables. Here, the following options were 
chosem: clade: other, genome: Scerevisiae, assembly: Apr.2011(sacCer3), group: genes and predictions, track SGD, table: sgdGene, region: genome, output: gtf 


9. Build transcriptome using “‘genes.gtf” from Step 8 and the transcriptome index from Step 7 using tophat (httpsy/ccb. jhu.edw/sofiware/tophat/index. shtml), 
tophat -p 4 -G annotation/genes. gtf --transcriptome-index=index/saccer3 -o out/out! --no-novel-junes index/saccer3 data/wt_0 S46 L004 R1_001.fastq 
10. Create index-files for the yeast rRNAs for bowtie2 (*.bt2). 

bowtie2-build index/sc-rrna.fa index/saccerrma 


11. Remove the random primer sequence, adapter contamination, and low quality tails (example given from httpsy/www. lexogen.conyquantseq-data-analysis/ for brary kit used in 
step 3). 


polyA = 
for ease in* fastq; do cat ${sample} | bbmap/bbduk.sh in=stdin.fq out=$ {sample} _trimmed_clean ref-bbmap/resources/polyA. fa.gz,bbmap/resources/truseq. fa.g7 k=13 ktrim=r 
forcetrimleff=11 useshortkmers=t mink=5 qtrim=t trimg=10 minlength=20; done 


12. Align each fastq file to the rRNA index (index/saccerrma from Step 10) and save unaligned sequences to new fastq file using bowtie (https//sourceforge.net/projects/bowtie- 
bio/). This filters out ribosonal RNA reads. 


bowtie -v2 -p4 index/saccerrma data/ {sample} .fastq_trimmed_clean --un /data/{sample}_trim_saccerrra-unalign.fastq >/dev/null 

13. Create alignments with tophat of the filtered data. 

tophat -p 4 --transcriptome-index=index/saccer3--no-novel-juncs -o out/trim-rma index/saccer3 data/{sample}_trim_saccerrrma-unalign. fastq 

14. Filter the data based on their quality usmg MAPQ filtering. 

for fifin out/trim-rma/*.bam,; do echo "Running on this file: $ff"; samtools view -bq 50 $ff> $ffEmapq50.bam; done 

15. Create index files for each .bam file ftom Step 14 using samtools (http://www. htshb.org/). 

samtools index out/trin-rma/{sample}_accepted_hits.bam 

16. Extract counts for each sample using htseq-count (https//htseq.readthedocs.io/en/release_0.11.1/install html). 

htseq-count -fbam out/trm+rrna/ {sample} _accepted_hits.bam genes. gtf> out/trim-rma/{sample}_accepted_hits_count.txt 

17. Counts-based expression values are calculated using R (https://cran.r-project.org/binwindows/base/), RStudio (https://rstudio.conyproducts/rstudio/download/#download) and 


RTools (https//cran.r-project.org/bin/windows/Rtools/, BiocManager (https//bioconductor.org/install’) and Dseq2 
(http//bioconductor.org/packages/release/bioc/html/DESeq2. html). 


18. Generate combined “‘count’-files that contain the counts of each replica ofa ik n sample as well as the reference sample (here “wildtype”) as columns in a tab delimited txt 
document. It should contain the name of the samples in the first row. first co! designates the target gene region. The data matrix is of the size of (number of samples) x 
(number of gene regions). 


region name wt-rep#| wt-rep#2 wt-rep#3 samplel-rep1 samplel-rep2 sample1-rep3 
YALO6IW 102000 
LIE IE-E IGE E- 


19. Generate a “‘table’-file for each sample indexing each column of data. The column sample _name should match the names of the samples in the first row of the "count"-file from 
Step 18. The column condition allows DSEq2 to correctly identify replicas (Step 21). 


sample_name condition 
wt-rep#1 WT 
wt-rep#2 WT 
wt-rep#3 WT 
samplel-rep1 sample1 
samplel-rep2 sample] 
samplel-rep3 sample] 


20. InR, load the Dseq2 library, the combined counts-file from Step 18 and the table-file from Step 19. 


library(DESeq2) 
count_table <— read.deln(“combined_counts _wt-samplel. txt sep="\t’",header=TRUE,row. names= region 1 name”) 
sample_table — read.delin(“‘table_wt-sample1 txt”,sep=\t’” sheader=TRUE,row. names="sample name” 


21. Plot data and write RNAseq expression values to file. 
dds <—- DESeqDataSetFromMatrix(countData = count_table,colData = sample_table,design = ~ condition) 
dds <-DESeq(dds) 
res <— results(dds) 
aoe — MAGS ee 
plot ones t{MA(res, main = “mutant”, ylm=c(-2,2), xlab = “‘mean count”) 
table(as.data. ftame(resOrdered), ae ’,quote=F ALSE , file="out/wt_sample1_p-values.txt” 
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